Mathematical Morphology: 

Star/Galaxy Differentiation &: Galaxy Morphology Classification 

Jason A. Moore^'^ , Kevin A. Pimbhlet^ , and Michael J. Drinkwater^ 

^ Department of Physics, University of Queensland, Brisbane QLD 4072, Australia 
^ Email: jmoore@pliysics.uq.edu.au 



o 
o 

(N 
> 

o 



> 
(N 

O 



O 

o 



X 



Abstract: 

We present an application of Mathematical Morphology (MM) for the classification of astronomical 
objects, both for star/galaxy differentiation and galaxy morphology classification. We demonstrate 
that, for CCD images, 99.3 ±3.8% of galaxies can be separated from stars using MM, with 19.4 ±7.9% 
of the stars being misclassified. We demonstrate that, for photographic plate images, the number of 
galaxies correctly separated from the stars can be increased using our MM diffraction spike tool, which 
allows 51.0 ± 6.0% of the high-brightness galaxies that are inseparable in current techniques to be 
correctly classified, with only 1.4 ± 0.5% of the high-brightness stars contaminating the population. 
We demonstrate that elliptical (E) and late-type spiral (Sc-Sd) galajcies can be classified using MM at 
an accuracy of 91.4 ± 7.8%. It is a method involving less 'free parameters' than current techniques, 
especially automated machine learning algorithms. The limitation of MM galaxy morphology based 
on seeing and distance is also presented. We examine various star/galaxy differentiation and galaxy 
morhpology classification techniques commonly used today, and show that the above MM techniques 
compare very favourably. 

Keywords: techniques: image processing - methods: data analysis - methods: miscellaneous 



1 Introduction 

The bulk of modern astronomical observations are per- 
formed with charge-coupled devices (CCDs) and hence 
are, almost by default, digital in nature. Also, many 
pre-existing sky surveys compiled with photographic 
plates have been digitized usin g plate measuring fa- 



cilities such as SuperCOSMQS jHambly et al.ll2001al: 
Hamblv. Irwin, fc MacGillivravl l2001bl : iHamblv et all 



200ld ) a nd the Automatic Plate Measuring (APM) 
machine (iKibblewhite et al]ll984l l. This digitization 
of astronomical data has provided the opportunity for 
computational solutions to many image analysis prob- 
lems. Over the past four decades, many digital tech- 
niques have been developed for filtering, pattern recog- 
nition, neural networks, artificial intelligence, and oth- 
ers. Unsurprisingly, many of these techniques have 
found applications the area of astronomical imaging. 

In particular, accurately and uniformly classify- 
ing objects in astronomical images is of great impor- 
tance. One way for this to be achieved is by per- 
forming the classification by eye, however there are 
issues with this. Firstly, eye-ball classification is ex- 
tremely subjective and it is diffic ult to e nsure uni- 
form classification. As an example, IShaveii (jl987l l pre- 
sented evidence for a possible large-scale structure in 
the distribution of quasars out to redshift of about 
0.5, concentrated in the direction of the cosmic mi- 
crowave background dipole . How ever, follow up study 
by Drinkwatcr & Schmidt^ (jl996[ ) concluded that there 
is no evidence for such a concentration, and that the 
earlier result was probably biased by the use of non- 
uniform image classifications. Secondly, most astro- 
nomical observations, particularly those for cosmolog- 



ical studies, seek to compile object catalogues over 
large regions of the sky. The corresponding increase 
in image data (a single photographic plate can contain 
hundreds of thousands of detectable objects) calls for 
the development of fast image processing, recognition, 
and classification by automated means. This problem 
of finding a robust method to cleanly distinguish be- 
tween all astronomical object types is one of the most 
challenging in astronomical image analysis. 

1.1 Star /Galaxy Differentiation 

The problem of separating the galaxy population from 
the stellar population within an astronomical image 
is one with a variety of proposed solutions. In broad 
terms, stars and galaxies can be distinguished from 
each other on the basis of their light profiles. Stars 
are expected to possess a highly peak point-like profile 
(the size of which is given by the point spread func- 
tion of the given image), whilst galaxies tend to be 
more extended in nature. One basic technique is to 
take two parameters which describe an object in an 
image, plot them against each other, and then use the 
line segment that optimally separates the stars from 
the galaxies. Parameters which have been commonly 
used (plotting against the magnitude of the obje ct) in- 
clude: is ophotal area (e.g.lReid fc GilmordI 19821 ), peak 
flux (e.g. iJones et al.lll99lh . or surface brightness (e.g. 
iHarmon fc Mamonlll993l '). 

As an example to consider, to separate stars from 
galaxies in the Las Campanas / AAT R ich Cluster Sur- 
vey (L ARCS) photometric catalo gue. iPimbblet et cd] 
(j200ll ) followed the technique of iReid et all (| 19961 ) . 
by calculating the magnitude difference between 4.0- 
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and 2.0-arcsecond diameter apertures on B-band expo- 
sures. Using these, they test various oth er parameters 
of me rit, including a concentration index (lAb raham et al.l 
Il994l \ full width at half maximum (F WHM), SEx- 
tract or's own estimate of stellarity (Ber tin fc Arnoiit^ 
Il996l \ and ellipticity. Of these parameters, they find 
that both FWHM and SExtractor both possess strong 
reliability and accuracy to separate stars from galaxies. 
SExtractor is very competitive as it offers not only a 
rapid dissemination of results, but has the ability to re- 
liably deblend sources and perform star/galaxy differ- 
entiation in a highly automated fashion, thus making 
the package well-suited to the reduction of large-scale 
galaxy survey data. 

There also exists a plethora of additional param- 
eters which can also be helpful, including: core mag- 
nitude, intensity weighted first moment radius, radius 
of gyrati on, th e iScbok (f979) classifier, the 'mo- 
ment' of iKronI |l980i ) . the Q classi fier oflLefevre et al.] 
(|l986l ). and the V parameter of iMaddox et al.l (|l990h 

A more sophisticated, and less subjective, approach 
is to automate the classification, using any number of 
the above parameters. The classifier can be trained 
on a set of pre-classified objects by performing an au- 
tomated ' machine learning ' algorithm such as decision 
trees (e.g. IWeir et al .11 19951 ) . self organizing maps (e.g. 
Miller fc Coell 996|), artificial neural networks (e.g. 



ing those mentioned above) directly to images. 

There are several issues, however, with using light 
decomposition. Firstly, the de Vaucouleurs profil e does 
not produce the best fit for all bulges (sec Andrcd akis fc SandersI 
Il994l : iMacArthur. Courteau. fc Hohzmanl l2003l Sec- 
ondly, even assuming a m ore general Core-Sersic model 
(jGraham fc Driverl l2005l . and references therein) for 
the bulge, the minimization of at least five parameters 
must be used to determine the 'best' solution, which 
is not always unique . 

IConselicg lj2003h presents an alternative solution 
for classifying galaxy morphology using a three dimen- 
sional 'CAS' volume: the concentration (C), asymme- 
try [A), and dumpiness (5*). They further argue that 
these three parameters correlate with important modes 
of galaxy evolution: gross form, major merger activity, 
and star formation. The definitions and formalism for 
the 'CAS' parameters can be found in Appendix A. 



1.3 Outline 



The plan for the remainder of the paper is as follows. 
In Section[51 the quantitative image analysis technique 
Mathematical Morphology (MM) is presented, includ- 
ing an overview of the operators used in this work. In 
Section 131 the problem of star/galaxy differentiation is 
DdewahnieWtl igated including a comparison of MM with the 



1992l.ll99 3: Bert in fc Arnout Jl996l : lAndreon et al.ll2000l: re sults of current techniques. In Sectiongl the problem 



Phihp et al.l200f r or fuzzy set reasoning (jMahonen fc FranttJ' galaxy morphology classification is investigated 



20001 ') 



SExtractor uses a trained neural network for 



star/galaxy classification. The main problem with this 
technique is the large number of 'free parameters'. Each 
node, the number of nodes, and the layering of the 
nodes are all parameters that need to be set or tweaked 
within an artificial neural network. Additionally, the 
'machine learning' algorithms must be extensively trained 
on pre-classified images which may introduce further 
biases. 

1.2 Galaxy Morphology 
Classification 

Another challenging problem in astronomical image 
analysis is finding a robust method to cleanly distin- 
guish between all major classes of galaxies. While most 
classifications of galaxy morphology have been per- 
formed by eye using the Hubble tuning-fork system , 
and its subsequent extensions (see [ Hubblelll926l ! Il936l : 
Ivan der Bergh|[l960l : ISandage Ill96lf r there still exist a 
few automated classification techniques. 

The most common technique for classifying the 
morphology of a galaxy is through light decomposition. 
This involves fitting a model to the surface brightness 
pro file of the galaxy , typically ba sed on the Exponen- 
tial, [de Vaucouleurs! l| 19481 , 1 19591 ) and/or ISersi"3(|l963l . 
Il968 ) profiles. These models effectively decompose 
the galaxy profile down into bulge (de Vaucouleurs) 
and disk (Exponential) components, making them a 
handy technique for classifying galaxies into morphol- 
ogy types defined by the Hubble tu ning-fork system. 
As an example, GALFIT (|Peng et al..,2002,) is a galaxy 
and point source fitting algorithm that fits two dimen- 
sional parameterized, axisymmetric, functions (includ- 



cluding a comparison of MM with the results of current 
techniques. We summarize our findings in Section [S] 



2 Mathematical Morphology 

MM is a branch of digital image processing and analy- 
sis or i ginati ng from the work of .Mathcron (1975) and 
ISerral(| 19821 ). who worked on a number of problems in 
mineralogy and petrology. They laid down the foun- 
dations of MM, a new approach to quantitative image 
processing. MM has now achieved a status as a power- 
ful method for image processing; with applications in 
material science, microscopic imaging, pattern recog- 
nition, medical imaging, and even computer vision. 

The International Society for Optical Engineering 
(SPIE) now holds an annual conference devoted to 
morphology applications, but astronomy applications 
have been limited. Some of these applications include 
image smoothing (Lea & Keller 1989), removing cir- 

rus emission (jAppleton, Siqueira. fc BasartI 119931: IH^ 

199d) , astro nomical object extraction (|Candeas. Braga Neto. fc Carvalh 
19961 . 1 1997]). and qu antifving simulated galaxy distri- 



butions ( Ue dal ll999l ). 

The central idea of MM is the process of examining 
the geometrical structure of an image by matching it 
with smaller patterns at various locations in the im- 
age. By varying the size and shape of the matching 
patterns, called structuring element^ useful shape in- 
formation can be obtained about the different features 
inside images and their interrelations. Although MM 



^In other disciplines, the structuring element is often 
referred to as a 'kernel'. In this work, however, we will 
stick with the original term. 



www. publish . csiro. au /journals /pasa 



3 



was originally designed to work on only binary images, 
subsequent modification lead to its application to grey- 
scale images. 

2.1 Structuring Elements 

An essential part of the MM morphological operations 
is the structuring element used to probe the input im- 
age. The structuring element is a pattern of discrete 
points relative to some origin, called the active centre. 
It is often chosen to be a circle or line segment, but 
other choices are possible depending on the particular 
application. This however should not be viewed as a 
limitation, since it actually leads to additional flexibil- 
ity in algorithm design. 

One of the most common structuring elements for 
best representing a circle is the diamond shape, cre- 
ated using the city-block metric0. We adopt this as 
the most appropriate structuring element for the as- 
tronomical applications in this paper, with the features 
of interest being approximately circular in shape. Fig- 
ure [1] contains 3 examples of the diamond structuring 
element, illustrating the start of an increasing series. 
The usefulness of a monotonically increasing series of 
structuring elements is given in Section [2.31 




Figure 1 : This figure illustrates an increasing series of 
diamond structuring elements, created using the city- 
block metric. These 3x3 (upper- left), 5x5 (upper- 
right) and 7x7 (bottom) structuring elements are the 
first three in the series. The shaded regions (both solid 
and hatching) indicate the grid contained within the 
structuring elements, with the solid fill representing 
the active centres. 

■^A diamond can be defined on a grid by the city-clock 
metric, d{x, y) = \x\ + \y\, where x and y are the grid coor- 
dinates relative to the active centre. Positions with d{x, y) 
less than a defined radius are included in the shape. 



2.2 Operators 

In MM there are two elementary morphological opera- 
tors: dilation and erosion, from which the other oper- 
ators and tools can also be formed. Here, we give just 
a brief summary of each operator, but a more detailed 
description of both the operation of, and the theoreti- 
cal basi s for, these basic operators ca n be found in the 
work of iHeiimanj l|l99l 1 1994 Il995l l . 

2.2.1 Dilation Operator 

Dilation is the process of brightening, expanding, and 
growing regions around the positive pixels of an image, 
I, using a structuring element, S. To perform this op- 
eration, the structuring element slides over the image. 
For each position that the structuring element is at, 
the maximum value contained within the structuring 
element (superimposed upon the input image) is used 
as the pixel value for the output image: 

{I®S)x,y= max (/^+i,a+j) (1) 

(i,j)SS 

The bottom panel in Figure [2] shows a dilated im- 
age. Note the outward expansion of the shape that 
results from the operation. 

2.2.2 Erosion Operator 

Erosion is complementary to, but the opposite of di- 
lation. Instead of expanding, and growing regions of 
the image, erosion shrinks and wipes them out. The 
process is exactly the same as for the dilation oper- 
ator, save that instead of calculating the maximum 
pixel value contained within the structuring element, 
it calculates the minimum: 

(/ e S):^,y = min (^+i,j;+^) (2) 

The upper-right panel in Figure [2] shows an eroded 
image. Note the inward contraction of the shape that 
results from the operation. 

2.2.3 Opening Operator 

Opening is the process of separating and isolating small 
regions of an image from larger ones, and simultane- 
ously eliminating information contained on small scales. 
By sequencing an erosion followed by a dilation, the 
opening operator can be formed: 

ios^{ies)®s (3) 

The opening operator also has the property of be- 
ing idempotent: 

{IoS)oS = IoS (4) 

This means that no more changes will occur to an 
image, even if the opening operation is repeated mul- 
tiple times. The opening operator is called a morpho- 
logical filter from this property, as well as it being an 
increasing morphological operator; that is, X C y im- 
plies that X o S o S . The middle panel in Figure [3] 
shows two opened astronomical images, one of a spiral 
galaxy and one of a group of elliptical galaxies. Note 
the elimination of small scale features, of the same size 
scale as the structuring element used, from the image. 
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Figure 2: The result of performing the dilation and 
erosion operations on the same shape, using a 3 x 3 
diamond structuring element. The upper- left panel 
shows the original shape. The bottom panel shows the 
dilated shape as the outline and the original shape as 
the solid fill. The upper-right panel shows the eroded 
shape as the solid fill and the original shape as the 
outline. 

2.2.4 Closing Operator 

The closing operator is the opposite of the opening 
operator, in a similar way as dilation is to erosion. 
Closing is the process of fiUing-in holes in the pixel 
distribution, and joining together previously separate 
image features. Mathematically, it is a dilation fol- 
lowed by an erosion: 

i»s = {i®s)es (5) 

The closing operator shares the same basic features 
of the opening operator such as being idempotent and 
working as a morphological filter. The bottom panel in 
Figure |3] shows two closed astronomical images, one of 
a spiral galaxy and one of a group of elliptical galaxies. 

2.2.5 Gradient Operator 

The elementary morphological operators can be com- 
bined together to detect any edges (i.e. sharp gradi- 
ents) around shapes in an image, as well as the curva- 
ture of these objects. The gradient operator is formed 
by combining the dilation and erosion operators; 

Vs/ = (/ffiS)-(/e5) (6) 
2.3 Granulometry 

Granulometry uses the property of morphological op- 
erators that they can be used to remove (or enhance) 




Figure 3: The result of performing openning and clos- 
ing operations on a spiral galaxy (left) and a group of 
elliptical galaxies (right). The top row shows the orig- 
inal astronomical images. The middle row shows the 
opened images, with small scale structures - on the 
order of the structuring element - being eliminated in 
both. The bottom row shows the closed images, with 
gaps between features being filled in. 
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artifacts in an image of a certain size and shape, analo- 
gous to the sieving of rocks by sequentially using sieves 
of progressively larger or smaller sizes. Granulometry 
consists of a sequence of closing (or opening) opera- 
tions using an increasing series of structuring elements. 
By measuring the volume under the image after suc- 
cessive closings, we can build a size distribution curve: 



$(A) = 



V{A)-V{0)' 



A > 



(7) 



where A is the parameterization of the series of clos- 
ing operations, ^(A) is the volume of the image at 
each iteration, and A is the parameter associated to 
the largest structuring element (selected to be the one 
large enough to 'wipe out' the object of interest). 

The size-distribution curve is monotonically increas- 
ing, so it may be considered as a cumulative probabil- 
ity distribution. The associated probability density 
function is called the pattern spectrum, which is given 
by the following discrete derivative: 



r(A) = $(A-h 1) - 'l'(A),A > 



(8) 



We can define a useful shape analysis attribute, the 
average size, which is the expected value of the pattern 
spectrum: 



A = ^Ar(A) 



(9) 



Another very useful attribute is the average rough- 
ness of the pattern spectrum, which is identical to the 
equation for average uncertainty (entropy) in informa- 
tion theory: 



e = -^r(A)iog[r(A)] 



(10) 



Figure |4] illustrates how the pattern spectrum will 
look for typical spiral nd elliptical galaxies. Also in- 
cluded on the plots are the values of average size and 
average roughness. 

Of the MM tools derived, the average size. A, and 
the average roughness, 9, are a powerful combination 
for characterizing shape and size in image analysis 
(see iMaragosI 119891 ) , providing statistical information 
about the shape content of the image. In this work, 
we present an implementation of basic MM operators 
and applications of MM in two astronomical object 
classification problems: star/galaxy differentiation and 
galaxy morphology classification. We expect MM to be 
superior to many of the current techniques for astro- 
nomical object classification, with the benefit of only 
using one 'free parameter', the structuring element, 
compared with at least five (ellipse fits) for the tech- 
niques mentioned in Section [1.11 and Section [1.21 



AVERAGE ROUGHNESS - 0.9 
AVERAGE SIZE = 4.07 




Figure 4: Pattern spectrum plots for CCD images 
of typical spiral (top) and elliptical (bottom) galaxies. 
The pattern spectrum represents the distribution of 
size within an image, based on the shape of the struc- 
turing element. The dotted vertical lines on the plots 
indicate the average roughness, and the full vertical 
lines represent the average size. 



3 Star/Galaxy Differentiation 

To test the advantage of using MM to differentiate be- 
tween stars and galaxies, both CCD and photographic 
plate images are used. 
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CCD chips possess many qualities that make them 
superior to photographic plates, including a large dy- 
namic range, higher quantum efficiency, a higher linear 
response, and relatively low noise levels. Photographic 
plates can have a larger surface area, making them per- 
fect for capturing a wide field-of-view, however they 
have have drawbacks such as the low sensitivity and 
non-linear response of the plates, saturation of bright 
objects, high surface brightness limits, and possible 
inconsistencies that exist between different plates in 
large surveys (such as the SuperCOSMOS Sky Survey 
or the APM Sky Catalogue). 

The major disadvantage in using photographic plates 
in astronomy is the saturation of bright objects, such 
as stars or bright galaxies. The profile of a saturated 
star becomes extended, similar to that of a galaxy. 
This makes it difficult to differentiate stars from galax- 
ies based on their light profiles. CCD images, how- 
ever, maintain most of the original light profile, giving 
them an advantage in star/galaxy differentiation. Us- 
ing both astronomical image types allows us to test 
the limitations of the various classification techniques. 

3.1 Classifier Performance 

We focus on the separation of the galaxy population, 
which is most important in, for example, galaxy sur- 
veys and cosmological studies. Therefore, we rate the 
performance of a star/galaxy classifier based on the 
following three criteria: 

1. Completeness — the percentage of galaxies cor- 
rectly classified as such. 

2. Contamination — the percentage of stars incor- 
rectly classified as galaxies. 

3. Correctness — the overall percentage of objects 
correctly classified as stars or galaxies. 

Rating a star/galaxy classifier by these criteria, 
particularly the first two, will give us an indication of 
how well it extracts the galaxy population (e.g. for fol- 
low up spectroscopic study) without having too many 
stars contaminate the sample. 

In this work, a multi-layer perceptron (MLP) is 
used as the automated classifier to separate the stars 
from the galaxies for a given star /g alaxy classification 
technique. MLP (see lHavkinl ll99^1 is the most com- 
mon neural network, and has been applied to many 
complex pattern recognition problems. It is composed 
of a layering of nodes (or perceptrons) whose output 
is determined by a linear combination of the previ- 
ous nodes put through a nonlinear activation function. 
The weighting factors within the linear combination 
are varied until the optimal solution for a given set of 
training data is found. Based on the number of layers, 
and nodes in each layer, any function can be modeled. 
In this work, however, we have reduced the number of 
network nodes to the minimum to perform basic linear 
classification. 



3.2 CCD Images 

The CCD images are taken from the data archive of 
ISmail et al.l (|1997| ) , who have produced a classified cat- 



alogue of morphological types of 1857 astronomical ob- 
jects within 10 intermediate-redshift (0.37 < z < 0.56) 
rich galaxy cluster fields, observed using the Wide- 
Field Planetary Camera 2 (WFPC2) on the Hubble 
Space Telescope (HST). They classify each detectable 
object within the field by eye, according to the Re- 
vised Hubble scheme — as either a star, spheroidal 
(E/SO), early-type spiral (Sa-Sb), late-type spiral (Sc- 
Sd), barred spiral (SBa-SBd) or irregular (Irr/Sm) galaxy. 
The reliability of these classifications is hard to quan- 
tify, but is considered by the authors to be accurate to 
within one class or better in at least 80% of cases. The 
objects in the sample range in magnitude from 23.5 at 
the faint end to magnitude 15.5 at the bright end, over 
the filter bands B (F450W), V (F555W), R (F702W), 
and I (F814W). 

We use this catalogue, both the HST images and 
the object segmentation generated using SExtractor, 
to test the effectiveness of the various classification 
techniques to differentiate between stars and galaxies 
in CCD images. We limit our results to those taken 
with the _R-band filter (725 classified objects). 

For these images, magnitude, aperture area, sur- 
face brightness (SB), and the MM tools, average size 
and average roughness, are used to differentiate be- 
tween pre-classified stars and galaxies. Before calcu- 
lating average size and average roughness, an MM gra- 
dient operation is performed to enhance sharp edges, 
increasing the difference between star and galaxy light 
profiles. To calculate average size and average rough- 
ness, an increasing series of diamond structuring ele- 
ments is used for reasons given in Section [2. II 

Figure [5] illustrates how magnitude, aperture area, 
and surface brightness can be used to differentiate be- 
tween stars and galaxies in CCD images, and the re- 
sults of using MM. In each plot, the line segment used 
to separate stars from galaxies is included — which 
has been constructed using an MLP. These results are 
summarized in Table [T] 



Table 1: Star/Galaxy Differentiation results for 
CCD images". 



Method 


Comp. (%) 


Cent. (%) 


Corr. (%) 


Mag-Area 


99.4 ±3.8 


16.1 ±7.2 


98.8 ±3.7 


Mag-SB 


99.4 ±3.8 


16.1 ±7.2 


98.8 ±3.7 


MM 


99.3 ±3.8 


19.4 ±7.9 


98.5 ±3.7 



° Using 725 objects — 31 stars and 694 galaxies. 



3.3 Photographic Plate Images 

iDovle et all (|2005t ) u se 4315 fields from the SuperCOS- 

MPS Image Archive llHamblv et al.ll2001al : iHamblv. Irwin, fc MacGiUivr 
l2001bl : [Hamblv et al.ll2001cD in their work on findmg 
optical counterparts for the HI Parkes All Sky Sur- 
vey (HIPASS). We use the same photographic plate 
images, along with the object segmentation generated 
using SExtractor, to test the effectiveness of using the 
various classification techniques to differentiate between 
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AVERAGE ROUGHNESS 

Figure 5: Star/galaxy classification plots for CCD 
image objects, between i?-band magnitude 23.5 and 
15.5. The upper two panels show the results of us- 
ing current techniques for the classification: the top 
panel showing the classification using magnitude and 
aperture area, and the middle panel showing the classi- 
fication using magnitude and surface brightness. The 
bottom panel shows our classification using two pa- 
rameters from MM — average size and average rough- 
ness. The classification line in each panel has been 
constructed using an MLP neural network. The clas- 
sification performance of MM is comparable to that 
of current techniques, in separating stars and galaxies. 
These results are summarized in Table [1] 



stars and galaxies in photographic plate images. We 
classify each detect able object w i thin t he field by eye 
in a similar way to ISmail et al.l (|l997l ). The objects 
in the sample range in i?-band magnitude from 20.0 
at the faint end to magnitude 13.0 at the bright end. 
For these images, magnitude, aperture area, surface 
brightness (SB) and peak flux (PF) are used to differ- 
entiate between the pre-classified stars and galaxies. 

Figure [6] illustrates how magnitude, aperture area, 
surface brightness and peak flux can be used to dif- 
ferentiate between stars and galaxies in photographic 
plate images. The magnitude is calculated using the 
integrated pixel flux over the aperture, the exposure 
time, and the photometric zeropoint. Hence, objects 
which have saturated to the maximum pixel value will 
have their magnitudes under-estimated. In each plot, 
the line segment used to separate stars from galaxies 
is included — which has been constructed using an 
MLP neural network. These results are summarized 
in Tabled 

For the photographic plate images, MM cannot be 
used to classify stars and galaxies in the same way that 
it can for CCD images. The 'peakiness' that MM is 
able to detect in the CCD images does not exist in the 
photographic images, due to the non-linear response 
of the plate. 

The biggest difficulties for current techniques are 
differentiating between stars and galaxies at the high- 
brightness limit and low brightness limit. This high- 
brightness limitation, best illustrated in Figure [S]( mid- 
dle panel), results from the saturation of the stars' pro- 
files, causing them to become extended like the profile 
of a galaxy. The low brightness limitation results from 
the resolution of the image, where there just are not 
enough pixels to accurately characterize the object. 

We use the tools of MM to calculate the sizes of 
the diffraction spikes, which become substantial in the 
stars brighter than magnitude 16.5. The power of 
the structuring element in MM is the ability to match 
against features within an image. We use an increasing 
series of structuring elements in the shape of a diffrac- 
tion spike (plus sign), and the technique of granulom- 
etry, to calculate the diffraction spike length. This 
feature presents a method for differentiating galaxies 
from bright stars. Figure [7] illustrates how this has 
been achieved, with the objects in this plot taken from 
the region in Figure |6] (middle panel) where distinc- 
tion between stars and galaxies is difficult, with mag- 
nitudes brighter than 17.0 and surface brightnesses 
brighter than 20.9 magnitudes per square arcsecond. 
The line segment used to separate stars from galax- 
ies is included — which has been constructed using an 
MLP neural network. These results are included in 
Tabled 

3.4 Results 

For CCD images, 99.3 ± 3.8% of galaxies can be sep- 
arated from stars using MM, with 19.4 ± 7.9% of the 
stars being misclassified, comparing very favourably 
with current techniques. This result, along with the 
fact that MM requires only one 'free parameter' com- 
pared with at least five for techniques involving aper- 
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13; 

20.4 20.6 20.8 21 21.2 21.4 21.6 21.8 22 22.2 
SURFACE BRIGHTNESS (magnitudes per square arcsecond) 





1 1.2 1.4 1.6 l.f 
DIFFRACTION SPIKE /B 



Figure 7: Diffraction spike length, calculated using 
MM, for the bright stars and galaxies found in the 
inseparable region of Figure [H] (middle panel). The 
classification line has been constructed using an MLP 
neural network. Galaxies that are unable to be sep- 
arated from bright stars (using current techniques 
alone) can be recovered using the MM diffraction spike 
tool. These results are included in Table [2] 



0.2 0.4 0.6 0. 



Table 2: Star/Galaxy Differentiation results for 
photographic plate images". 



Figure 6: Star/galaxy classification plots for photo- 
graphic plate objects, between B-band magnitude 20.0 
and 13.0. The top panel shows the classification us- 
ing magnitude and aperture area. The middle panel 
shows the classification using magnitude and surface 
brightness. Notice that the classification line has ro- 
tated due to low-brightness and high-brightness mix- 
ing. The bottom panel shows the classification using 
magnitude and peak fiux. The classification line in 
each panel has been constructed using an MLP neural 
network. In these three plots, stars brighter than mag- 
nitude 16.5 are completely inseparable from the galaxy 
population. These results are summarized in Table [2l 



Method 
Mag-Area 
Mag-SB 
Mag-PF 



Comp. (%) Cont. (%) Corr. (%) 



22.4 ±4.0 
22.4 ±4.0 
21.4 ±3.9 



1.6 ±0.5 
1.6 ±0.5 
1.2 ±0.5 



83.2 ±3.4 
83.2 ±3.4 
83.4 ±3.4 



Using 713 high-brightness objects 
and 143 galaxies. 



570 stars 
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tures, presents MM as a powerful method for star/galaxy 
differentiation in CCD images. 

For photographic plate images, conventional tech- 
niques fail to differentiate between stars and galaxies 
at the high-brightness limit. A solution to this will 
either involve human intervention or a complex algo- 
rithm. MM provides provides a far less complex solu- 
tion, involving less 'free parameters' than other com- 
plex alternatives. Using the MM diffraction spike tool 
allows 51.0 ± 6.0% of the galaxies that are inseparable 
in Figure |6] (middle panel) , with magnitudes brighter 
than 17.0 and surface brightnesses brighter than 20.9 
magnitudes per square arcsecond, to be correctly sep- 
arated from the bright stars. 

4 Galaxy Morphology 
Classification 

To test the advantage of using MM to classify galaxy 
morphology we now exclusively use CCD imaging. The 
C CD images a r e aga in taken from the data archive 
of ISmail et all (|1997| ). It must be noted that pecu- 
liar and merging galaxies are not included in this cat- 
alogue, so the ability of classification techniques to 
discriminate between such galaxy types is not able 
to be determined. For the images, the techniques of 
Sersic fitting (using the GALFIT software packag43), 
the 'CAS' parameters, and the MM tools, average size 
and average roughness, are used to differentiate be- 
tween pre-classified elliptical (E) and late-type spi- 
ral (Sc-Sd) galaxies. Before calculating average size 
and average roughness, an MM close operation is per- 
formed to average any spiral arm features into a disk 
component. To calculate average size and average rough- 
ness, an increasing series of diamond structuring ele- 
ments is used for reasons given in Section [2. II 

Figure [8] illustrates how the Sersic index, n, and 
Figure |5] illustrates how the 'CAS' parameters can be 
used to classify elliptical and late-type spiral galaxies. 
In each plot, the line segment used to separate ellip- 
tical galaxies from late-type spiral galaxies is included 
— which has been constructed using an MLP neural 
network. The galaxies with a core size less than 4 pix- 
els were excluded from the calculation. We found a 
severly undersampled core can create difficulties with 
the dumpiness and asymmetry parameters, not pro- 
viding a fair comparison against the other techniques. 
These results are summarized in Table [3] 

Figure [Tol shows how MM can be used to classify el- 
liptical and late-type spiral galaxies. The line segment 
used to separate elliptical galaxies from late-type spi- 
ral galaxies is included — which has been constructed 
using an MLP neural network. These results are in- 
cluded in Table H 

To test the effect of atmospheric smoothing (or 
seeing) on the performance of the MM galaxy mor- 
phology classification, the HST images are artificially 
blurred using the IRAF Gaussian task from space- 
based (FWHM of 0.1 arcsec) to ground-based (FWHM 
of 2.12 arcsec) seeing conditions. The left panel of 




LOG SERSIC INDEX (n) 



Figure 8: Galaxy morphology classification plot for 
Sersic fitting, using the elliptical (E) and l ate- type spi- 
ral (S c-Sd) galaxies in the data archive of lSmail et al.l 
(jl997l l. The two dotted lines represent the de Vau- 
couleurs (bulge, n = 4) and Exponential (disk, n = 1) 
profiles. The full line represents the classification line, 
which has been constructed using an MLP neural net- 
work. This plot demonstrates the classification per- 
formance of Sersic fitting, using the GALFIT software 
package, in separating elliptical and late-type spiral 
galaxies between magnitude 23.5 and 16.0. These re- 
sults are summarized in Table |3] 



Table 3: Galaxy Morphology Classification results 
for CCD images". 



Method 


Correctness 


%) 


Sersic Fitting (GALFIT) 


89.8 ± 


6.3 


Asymmetry-Concentration 


84.2 ± 


7.4 


Asymmetry— dumpiness 


80.9 ± 


7.3 


Clumpiness-Concentration 


79.6 ± 


7.2 


MM 


91.4 ± 


7.8 



" Using 152 objects — 71 elliptical (E) and 81 
late-type spiral (Sc-Sd) galaxies. 



^http: / / zwicky. as. arizona. edu / ~cyp / work / galfit / galfit . html 
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Figure 9: Galaxy morphology classification plots for 
the 'CAS' parameters, using the elliptical (E) and 
la te-type spira l (Sc-S d) galaxies in the data archive 
of lSmail et aP l|l997l ). The classification line in each 
panel has been constructed using an MLP neural net- 
work. The plots demonstrate the classification perfor- 
mance of the three 'CAS' parameters, in separating 
elliptical and late-type spiral galaxies between mag- 
nitude 23.5 and 16.0. These results are included in 
Table El 




1 1.1 1.2 1.3 

AVERAGE ROUGHNESS 



Figure 10: Galaxy morphology classification plot for 
MM, using the elliptical (E) and late-type spi r al (Sc - 
Sd) galaxies in the data archive of ISmail et al.l (jl997l ). 
The plot shows galaxy morphology classification using 
two parameters from MM — average size and average 
roughness. The classification line has been constructed 
using an MLP neural network. The classification per- 
formance of MM is comparable to current techniques, 
in separating elliptical and late-type spiral galaxies be- 
tween magnitude 23.5 and 16.0. These results are in- 
cluded in Tabled 



Figure [TT] shows how the classification line is altered 
as the seeing increases, shifting toward higher aver- 
age size values as the galaxy profiles extend due to 
the blurring. The right panel of Figure [TT] shows how 
the classification performance of MM is reduced as the 
seeing increases. 

4.1 Results 

For CCD images, 91.4±7.8% of elliptical and late-type 
spiral galaxies can be correctly classified using MM, 
comparing very favourably with current techniques. 
These results present MM as a powerful alternative 
for galaxy morphology classification in CCD images. 

Atmospheric seeing effects the MM galaxy mor- 
phology classification by shifting the classification line 
toward higher average size values and reducing the cor- 
rectness of the classification. The performance of the 
classification is acceptable (at least 80%) for seeing 
FWHM equal to or better than 0.4 arcseconds. For a 



redshift of 0.465 ± 0.095 {Da = 1193.1 



+ 122.4 



Mpc; for 



Ho = 71, Qm = 0.3, Q,A = 0.7) this corresponds to a 
physical size of 2.19lg'29 '^P^- This size is of the same 
order as the central bulge of each galaxy and, therefore, 
the Gaussian seeing profile will become a significant el- 
ement of the galaxy profiles. The difference between 
the profiles of elliptical and spiral galaxies is reduced 
with seeing sizes even larger, preventing MM from be- 
ing able to distinguish between the two classes. From 
this we can write the following seeing-distance rela- 
tionship — MM will classify galaxy morphology at a 
performance of 80% or greater if: 



sxd< 477.24; 



-48.96 
-60.84 



(11) 
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Figure 11: Effect of atmospheric smoothing on the 
MM galaxy morphology classification. The top panel 
shows the change in the classification line between 
space-based (FWHM of 0.1 arcsec) to ground-based 
(FWHM of 0.54 arcsec) seeing conditions. The bot- 
tom panel shows the reduction in the correctness of 
the classification between space-based (FWHM of 0.1 
arcsec) to ground-based (FWHM of 2.12 arcsec) see- 
ing conditions. The performance is acceptable (at least 
80%) for seeing better than 0.4 arcsec. 



where s is the seeing FWHM (in arc seconds) and d is 
the cosmological angular size distance (in Mpc). The 
consequence of this is that, for a ground-based tele- 
scope with a 1.0 arcsecond seeing FWHM, the MM 
classification is acceptable to a maximum distance of 
477.24+g^;^^ Mpc {z = 0.13 ±0.02). 

The difficulty remains to find what physical pro- 
cesses the average size and average roughness will cor- 
relate wit h . For example, using the 'CAS' framework, 
IConsehcd (|2003l ) use some quantitative reasoning to 
show that the concentration index correlates with gross 
form (bulge to disk ratio), asymmetry correlates with 
recent merger/interaction activity, and dumpiness cor- 
relates with star formation (using the Ha equivalent 
widths). We are currently investigating the relation- 
ship between average size and average roughness with 
respect to star formation rate in galaxies. This will 
provide correlation with both internal and external 
galaxy mechanisms, and will be presented in a future 
publication. 



5 Summary 



We have demonstrated that, for CCD images, 99.3 ± 
3.8% of the galaxy population is separated from the 
stellar population using MM, with 19.4 ± 7.9% of the 
stars being misclassified, comparing very well with cur- 
rent techniques involving aperture magnitudes. We 
have also demostrated that, for photographic plate im- 
ages, the MM diffraction spike tool allows 51.0 ±6.0% 
of the bright galaxies, that are inseparable from the 
bright stars in current techniques, to be correctly clas- 
sified, with only 1.4±0.5% of the high-brightness stars 
contaminating the population. Lastly, we have demon- 
strated that elliptical (E) and late-type spiral (Sc-Sd) 
galaxies can be classified using MM at an accuracy of 
91.4 ± 7.8%, which compared well with the techniques 
of Sersic fitting and the 'CAS' parameters. 

What we have presented here is an automated method, 
alternative to current techniques, for both star/galaxy 
differentiation and galaxy morphology classification. 
It is a method involving less 'free parameters' than cur- 
rent techniques, especially automated machine learn- 
ing algorithms, and doesn't require a machine learning 
algorithm for the training of the classifier. The clas- 
sification tools developed in this work could, however, 
also be used as an additional layer of parameter space 
for any current machine learning approaches. 

We intend to use this method to study the rela- 
tionship between galaxy morphology and star forma- 
tion rate, and how each vary within the supercluster 
environment. This environment allows us to probe 
the complete range of densities at which galaxies are 
affected by star formation suppression and morphol- 
ogy alteration mechanisms. We shall study how these 
galaxy properties vary between dense cluster cores, 
inter-cluster filaments, and the low-density 'field' re- 
gions. 
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High Spatial Frequency 

The dumpiness parameter, S, is defined as l|Conselicel 
the amount of light contained in high spatial 
frequency structures to the total amount of light in 
the galaxy. Computationally, this is defined as the 
volume of the image of a galaxy which has been blurred 
(smoothed) using a filter and then subtracted from its 
pre-smoothed image, then normalized to the original 
image volume: 



s- 



EI 



(14) 



where / is the ori ginal imag e and B is the blurred 
(smoothed) image. " IConselicel (|2003l ) use a smoothing 
filter of size i x 1.5 x r{ri = 0.2) and exclude the region 
inside ^ x 1.5 x r{ri = 0.2), as its high-frequency power 
is unrelated to the stellar light distribution. They also 
force any negative pixel values in the difference image, 
I — B, to zero before computing the dumpiness. 



Appendix A: 
'CAS' Volume 

Here, we give just a brief summary of each parameter, 
but a more detailed description of the operation of 
these can be found in the work of IConselica (|2003i ). 

Concentration of Light 

The c oncentration index, C, is defined as (|Bershadv. Jangren. fc Conselicel 
I2OOOI I the ratio of the 80% to 20% curve of growth radu, 
normalized by a logarithm: 

C7 = 5xlog(^) (12) 
''20% 

where represents the size of the aperture which 
contains x% of the total light flux of the galaxy (in 
some implementations the 75% and 25% curve of growth 
radii are used instead). 

Asymmetry 

The asymmetry index, A, is defined as (jConselice. Bershadv. fc JangrenI 
I2OOOI I the volume of the image of a galaxy which has 
been rotated 180° around its center and then sub- 
tracted from its pre-rotated image, then normalized 
to the original image volume: 

A (13, 

whe re I is the origina l image and R is the rotated im- 
age. IConselic^ l|2003t ) first reduce the effective resolu- 
tion of the image, 7, using a filter of size i x r{ri = 0.2), 
to have asymmetry only sensitive to large-scale stellar 
distributions. 



